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Abstract. Zeeman-Doppler Imaging (ZDI) is a powerful inversion method to reconstruct stellar 
magnetic surface fields. The reconstruction process is usually solved by translating the inverse 
problem into a regularized least-square or optimization problem. In this contribution we will 
emphasize that ZDI is an inherent non-linear problem and the corresponding regularized opti- 
mization is, like many non-linear problems, potentially prone to local minima. We show how 
this problem will be exacerbated by using an inadequate forward model. To facilitate a more 
consistent full radiative transfer driven approach to ZDI we describe a two-stage strategy that 
consist of a principal component analysis (PCA) based line profile reconstruction and a fast 
approximate polarized radiative transfer method to synthesize local Stokes profiles. Moreover, 
we introduce a novel statistical inversion method based on artificial neural networks (ANN) 
I I . which provide a fast calculation of a first guess model and allows to incorporate better physical 
■ constraints into the inversion process. 
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1. Introduction 



Zeeman-Doppler Imaging (ZDI) was pioneered by ISemel ( 1989I ). although first de 



scribed as a mere detection method for surface magnetic fields on r apid rotating stars , 
the method quickl y evolved to a real mapping (inversion) technique ( Donati et al. IQQOl : 



^—1 ■ tne metnoa quiCKi y evoivea to a real mappmg (mversionj tecnnique (|Uonati et ai-lliljijU : 
[ Brown et al.lll99ll ). Today ZDI or Magnetic-Doppler Imaging (MDI) is a synonym for 



the inversion of time and phase resolved spectropolarimetric observations to reconstruct 
, stellar magnetic surface fields. Since that time ZDI was a success story of its own and 
has enormously contributed to our current understanding of stellar magnetism (e.g 
Donati e t al.lll997l:lDonatilll999HHussain et al.l20o3lKochukhov et al.l200llDonati et al 



2006, 2007r Petit et al.1 2008f ). However, there are a number of fundamental problems 



m 

o : 

which has to be considered when dealing with ZDI as an inverse problem. One of these 
' problems comes from an inadequate formulation and approximation of the forward prob- 
^ , lem while another comes from the inherent non-linearity of the inverse problem. 



2. The forward problem 

Inverse problems arise whenever one searches for causes of observed data or desired 
effects. The most general way of solving the inverse problem (directly or indirectly) is 
to utilize a model of the underlying forward problem. The forward problem constitutes 
our physical theory and/or knowledge about the system to make predictions about the 
outcome and result of measurements. The direct or forward problem in ZDI is typically 
the polarized radiative transfer based on a model of the stellar atmosphere. Here we may 
encounter the first fundamental problem, owing to our limited theoretical knowledge 
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(e.g. lack of atomic and/or atmospheric parameters) as well as numerical and compu- 
tational constraints we are restricted to an approximate description of the real physical 
system. Critical in this context is to provide an appropriate and adequate parameteriza- 
tion of the model , i.e the minimal number of free variable that determine the measur- 
able outcome. The determination of a particular parameterization is not always obvious 
and requires a careful evaluation. Formally the forward problem of ZDI can be writ- 
ten in_a^_com£a£t_wB^;j_b};_^^ of the Stokes vector as described 
bv lLandi Deglinnocenti fc Landi Deglinnocentil ( 1985h . This allows us to write the disk- 



integrated Stokes vector /* at a wavelength A and for a particular rotational phase 4> as 
follows : 



7*(A,0)= / / O{O,s{M',0),X{M'),M',(l)) j {s{M',e),X{M'),M',cl))dscosedM. 
Jm Jo 

(2-1) 

The vector X provides the parameterization of the model and comprises parameters like 
the magnetic field vector, temperature, abundance, etc., dM denotes the infinitesimal 
surface element at the position M' (given in spherical coordinates), 9 denotes the angle 
between the surface normal at M' and the direction to the obse rver, s is the geometrical 
path length towards the observer, j is the emission vector (see iStenflolil994l ). and O is 
the evolu tion operator of the formal solution which incorpor ates the absorption matrix. 
FoUowing lLandi Deglinnocenti fc Landi Deglinnocenti (1985), we can write the evolution 



operator, in the case of a piecewise constant absorption matrix K, as 

0(s, s', M, X{M)) = e--f<^(A^^^ . (2.2) 

Equation 12.11 can also be written symbolically in a more compact form as 

r(A,0) = nix), (2.3) 

where n represents the formal non-linear integral operator of the forward problem. Sev- 
eral things must be kept in mind here, both integrals, the outer surface integration over 
the model atmosphere as well as the inner transport integral through the atmosphere, 
must be discretized in order to cope with the problem numerically. We should be aware 
that this already means that we transform our originally continuous problem into a 
discrete one with finite dimension. As mentioned above, the finite parameterization of 
our model X{M) deserves particular attention because the limitation to those variables 
that we consider as necessary and sufficient, will directly determine the dimension of 
the model space, we will see later how the neglect of the temperature may have drastic 
consequences for the inversion. The radiative transfer model we use is still formulated 
in local thermodynamic equilibrium (LTE) which is quite sufficient for the majority of 
photospheric spectral lines we are using but certainly has its limitations. Moreover, it is 
customary to describe the atmosphere as height independent, i.e. neglecting all height 
gradients in most of the model parameter, an approximation which becomes questionable 
in particular for stars with extended atmospheres like giants. All these approximations of 
the real physical situation in stellar atmospheres will therefore have direct consequences 
on the inversion process. Finally, it should be also clear from equation (|2.ip and (|2.2p 
that our forward problem is in general non-linear in X, an aspect that will be discussed 
further in the following section. 

3. The inverse problem 

The retrieval of model parameters from measured data constitutes the general defini- 
tion of a inverse problem. In the particular case of ZDI the measured data are given in 
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the form of phase resolved Stokes profile observations. In general there is a distinction 
between continuous and discrete inverse problems which means that we want to retrieve 
either a continuous functions or a parameterized model. In that sense it is already the 
formulation of the forward problem which defines the kind of inverse problem. The same 
holds for the definition of linear or non-linear inverse problems, which is determined by 
the linearity or non-linearity of the forward problem. Inverse problems, like ZDI, are 
often said to be ill-posed by which, in general, we mean that i.) the inverse problem 
might be non-unique and ii.) the inverse problem might not depend continuously on the 
data, i.e. the solution is unstable against small perturbation of the observed data. The 
non-uniqueness may have different reasons which can be a real intrinsic non-uniqueness 
of the problem, or uncertainties in the observed data, or an ill-defined forward model, 
or a lack of data that sufficiently constrain the problem. The problem of ill-posedness 
can be effectively addressed by a regularization approach, i.e. adding additional a priori 
assumptions (i.e. constraints) to the inverse operator. A powerful method in this context 
is the Tikhonov regularization but many other forms of regularizations are also possible. 
For linear problems it c an indeed be show n that the regularization leads to a unique and 
stable solution (see e.g. Engl et al. 1996f) . However, the forward problem we deal with 



in ZDI is non-linear, and at first, it is not obvious how this will affect the uniqueness 
and stability of our solution. The general way to proceed is to formulate and minimize a 
regularized least-square problem. This approach, relies on the linearization of the forward 
operator, which can be obtained in our case from a first order expansion of (|2.3p . 



n[X) = 0(Xo) + J(X - Xo) , (3.1) 

where J is the Jacobian matrix of O. Using the sum of square error and a regularization 
functional G{X) the objective functional F{X) can be written as 

F{X) = \\Iobs-r\\^ + aG{X) (3.2) 

where a is the regularization parameter which determines the influence of the regular- 
ization. The regularization functional G can be expressed in the from \\L{Xref — -X^o)P 
where X^ef is a parameter vector which represents our a priori knowledge (assumption) 
about the solution, and L is a suitable matrix approximation of a differential operator. 
The essence of the non-linear solution is then to repeat the process of solving the lin- 
earized solution i n terms of a grad ient descent method. Using a Gauss-Newton type form 
of linearization (.Engl et al. 19961 ) we obtain from (|3.2[) for an increment AXn+i at the 



iteration n + 1 the following solution. 



AX„+i - (J* J + aL*Ly' {J*{Iobs - I*)) + aL*L{Xref - X„) , (3.3) 

where J* is the conjugate transpose of J. The stabilizing effect of the regularized solution 
is then ensured by the positive definiteness of the inverse {J* J -f aL*L)~^. As the 
regularization parameter is held constant throughout the iteration, the regularization 
part in (j3.2p will have an increasing infiuence as we approach the solution and thus 
provides the additional benefit of preventing the inversion from fitting arbitrary details 
(i.e. noise) in the spectra. However, although we gain a stabilizing effect at each iteration 
step, it should be clear from (|3.2[) that even when G provides a convex constraint this 
needs not necessarily be true for F{X) which, in general, is still a non-convex function 
due to the non-linearity of p.ip . The inverse problem may therefore have many local 
minima in the parameter space in which a gradient descent method can get stuck and a 
regularization alone can not guarantee to find the unique global minimum. Furthermore, 
the determination of an appropriate regularizati on parameter a is by no means an easy 
task and can require high computational efforts ( Engl et al. 19961 ). 
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Figure 1. The mercator plot of the magnetic field distribution : (top) the synthetic reference 
model, (middle) the reconstructed magnetic field from a simultaneous temperature and magnetic 
field inversion, (bottom) the result from the restricted inversion (fixed temperature). 



4. More than one solution — an inadequate forward model 

To show how much the solution of the ZDI problem depends on the forward model 
and the chosen parameterization we made a simple synthetic experiment. The stellar 
model surface consists of a large bipolar magnetic field region. The stellar atmosphere is 
that of a typical fast rotating K subgiant star (i.e. \og{g) = 3.5, T^ff = 4600 K, [M/H] 
= -0.5, vsini = 40 km/s). One of the magnetic regions is located in a cool spot like 
structure while the other polarity is embedded in a hot and more dispersed like structure 
to mimic a faculae or plage region, see top of Figure [2] The magnetic field is exclusively 
radial oriented in both magnetic regions. Two inversion were performed, the first one 
worked on the basis of the correct parameterization with the magnetic field vector and 
temperature as free parameter, while the second inversion runs with a fixed temperature. 
This allows the second inversion only t o seek in a subspac e for a possible solution. For the 
inversion we have our ZDI code iMap (jCarroU et al.ll2007( ) with a surface segmentation of 
1800 elements, and the Zeeman-sensitive iron line Fel A 6173. Although we have all four 
Stokes components available we restricted the number of available Stokes components 
to Stokes / and Stokes V of the Zeeman-sensitive iron line Fel A 6173. We have added 
noise with a signal-to-noise level of 1000 to the original model spectra. Both inversion 
run with the same value for the regularization parameter. The first inversion converged 
to a solution that is close to the reference field structure, see Figure [T] middle. The 
surface structure is well reconstructed from the simultaneous temperature and magnetic 
field inversion and also the Stokes V spectra (Figure [21 left) are well reproduced. In 
the second inversion where we have neglected the temperature as a free parameter the 
fit to the reference spectra is again very good but the resulting magnetic surface field 
is drastically different (Figure [l] bottom) . The surface field shows a characteristic high 
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Figure 2. The fits (daslied lines) to tiie noisy reference Stokes V profiles from the two test 
inversions : on the left the fits of the simultaneous temperature and magnetic field inversion, 
Xrsd = 1-01 and on the right the fits of the restricted inversion, xled = 1-02. 



latitude band of a unipolar azimuthal field with an longitudinal extension of about 180'^ 
which is in sharp contrast to the original reference field. The inversion in the restricted 
parameter regime has apparently converged to a false solution, although the fits are 
very good (Figure [H right). The quality (goodness) of the fit (Figured]) itself do not 
provide enough information in this case to distinguish between the competing solutions. 
This issue could readily be resolved if the corresponding Stokes Q and U profiles are 
taken into account (not shown here). Due to the critical role of the temperature and 
the associated radiative transfer effects it will be of particular importance to provide an 
accurate and appropriate forward model for the inverse problem. We will investigate the 
impact of radiative transfer eff ects and inadequate modeling for ZDI inversions in greater 



detail in a forthcoming paper (| Carroll et al 



quate m 



5. Towards a full polarized radiative transfer approach 

We have seen from the foregoing synthetic example how the underlying model can affect 
the outcome of the inversion. Moreover we have seen the critical role of the temperature 
which is of course tightly correlated to the radiative transfer effects. It is therefore worth- 
while to include a full polarized radiative transfer in the inversion process to make the the 
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problem setting as realistic as possible. There are at least two major obstacles towards 
a full radiative transfer driven approach : 

• the noise level of individual Stokes line profiles, and 

• the computational demands of an iterative inversion. 

The first point refers to the cases that most observed and measured Stokes V, Q and U 
spectra are heavily contaminated by noise. Although there exist p owerful multi- l ine re - 



construction techniques as the least-square deconvolution method of lDonati et al.l (|l997r ), 
they do not easily allow one to apply a radiative transfer modeling approach. The cru- 
cial point is that this technique only retrieve a kind of mean line profile which has no 
straight forward interpretation on the basis of spectral line synthesis. The second point 
highlights the computational effort one is facing by the iterative approach to the inverse 
problem. To address both of the above listed problems and to pave the way for a full 
radiative transfer modeling approach we propose a two-step strategy, a multi-line recon- 
struction method which enables the reconstruction of individual Stokes profiles and a 
fast approximate radiative transfer calculation by artificial neural networks. 

5.1. Multi-Line PC A reconstruction 

The basic idea of the multi-line PCA reconstruction technique is to extract the system- 
atic features within the observed data set, i.e. Stokes profiles, and at the same time to 
reduce the noise. This is done by identifying the redundancy within the dataset and 
describing the dataset with a minimal number of characteristic parameter (i.e. dimen- 
sions) as possible. The redundancy among the observed data can be described by the 
correlation between the individual spectra such that we can express the correlation of 
entire observational dataset by the covariance matrix Cx which can be written, after 
transforming the spectra into the velocity domain, as 

n 

where v — cAX/X and n is the number of individual spectral line profiles used for the 
analysis and X the mean Stokes profiles of all spectral lines. We then seek a set of or- 
thonormal direction in the data space (given by the spectral resolution) along which the 
variance in X is maximized in descending order. These new set of coordinate axes which 
accounts for the maximum variance in the observed data (Stokes spectra) can be deter- 
mined by calculating the eigenvectors of the covariance matrix Eq. (|5.ip . The so called 
Principal Component Analysis (PCA) or Karhuncn-Loeve transformation ( Bishop Ill995h 



will provide us with the calculation of the eigenvectors of the covariance matrix of the 
observation whereby the eigenvectors are ordered according to their associated eigenval- 
ues (variances). We use the PCA method to decompose the entire set of observed Stokes 
spectra into a new coordinate system. This procedure will project the most coherent and 
systematic features in the observed Stokes profiles into the first few eigenvectors with the 
largest eigenvalues while the incoherent features (i.e. noise or blends) will be mapped to 
the less significant eigenvectors (with low eigenvalues). After having calculated the set 
of orthogonal eigenvectors we can use this new basis to decompose all observed Stokes 
spectra into the new basis of eigenvectors ui , as 

Xk{v) = Yak,iui{v) , (5.2) 



where ak,i = Xk{v)ui{v) is the scalar product (i.e. the projection or the cross-correlation) 
between the observed k — th Stokes profile Xk{v) and the l — th eigenvector ui{v). If we use 
a limited number of the first few I eigenvectors for this decomposition the reconstruction 
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will be made only with those components who carry the most significant information 
about the systematic features and inherent line characteristics. We therefore reconstruct 
individual Stokes profiles to an extent that the majority of the individual line character- 
istics are preserved and only a small contribution of uncorrelated effects are present. To 
determine the maximum number of meaningful eigenvectors we calculate the covariance 
matrix of pure noise profiles with a S/N ratio that corresponds to the noise level of the 
observations. This allows us to estimate the magnitude of the eigenvalues from which the 
corresponding eigenvectors will mainly contain noise. Since the PCA multi-line method 
now allows the reconstruction of individual line profiles, all the known line parameters 
of a particul a r spe ct ral line can be used in a subs equent DI and ZDI inversion. See 
Carroll et all (|2007t) : iMartmez Gonzalez et all (|2008l) for a more detailed description of 



the method. 

5.2. A fast Stokes profile synthesis 

The basic idea of our proposed approach is to emulate the process of polarized line 
formation by using an adaptive regression model, that is fast to evaluate, and which 
provides the required accuracy. The adaptive model we seek must provide a sufficient 
complexity to describe the non-linear mapping of Eq. (|2.ip . between the most prominent 
atmospheric parameters and the resulting Stokes spectra. For this purpose, we used a 
supervised machine learning algorithm, e.g., an artificial neural network (ANN) model. 
A popular type of of ANN, also used in this work, is the so called multilayer-perceptron 
(MLP) which is known as a universal function approximator. The MLP can be regarded 
as a class of nonlinear function, which performs a continuous and multivariate mapping 
between an input vector x and an output vector y. The network function represents a 
function composition of elementary non- linear functions g{a). These elementary functions 
are arranged in layers whereby each of these functions in one layer is connected via an 
adaptive weight vector w to all elementary functions in the neighboring layers. The l~th 
output (component of the output vector y) of a two layer (of weights) MLP for example 
can be written as 



yi{x]w) ^gk I J2^kj9j [ J2^J^^^'' ] ) ' (^'^^ 

\i=0 



where Xi represents the i-th component of the input vector x and icj]^^ the connecting 
weight from the i-th input component to the j-th elementary function gj in the first unit 

(2) 

layer. The weights wj^j then connecting all the functions gj with the functions gk in the 
second unit layer. The capital letters (I, J) giving the numbers of elementary functions 
(units) in the respective unit layer. The elementary functions g{a), which are also called 
activation functions, are given by the following type of sigmoid function, 

= . • (5.4) 

1 + exp(~a) 

The network function y{x) will thus process a given input vector x by propagating this 
vector (via multiplication with the individual weight values and subsequent evaluation 
of the different activation functions) through each layer of the network. The particular 
function that will be implemented by the MLP is determined by the overall structure 
of the network and the individual adaptive weight values. The process of determining 
these weight values for the MLP is called (supervised) training and is formulated as 
a non-linear optimization process. This training is performed on the basis of a repre- 
sentative dataset, which includes the input to target (i.e. training output) relations of 
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the underlying problem. This process is similar to a non-linear regression for a given 
data set, but as the underlying model (i.e., MLP) is much more general, the regression 
function is not restricted to a specific predetermined (or anticipated) model. In fact, it 
can be shown th at MLPs provi de a general framework for approximating arbitrary non- 
linear functions ([Bishop lUiii). In our case the input-to-target relation is dictated by 



our synthesis problem, for the input vectors we have chosen the following atmospheric 
parameters : the temperature and pressure structure of model atmospheres (described by 
the effective temperature), logarithmic gravitation, iron abundance, the local bulk veloc- 
ity of the plasma, microturbulence, macroturbulence, magnetic field strength, magnetic 
field inclination, magnetic field azimuth and the LOS angle between the observer and the 
local normal. For the output parameter we have used the full Stokes vector profiles of the 
Zeeman-sensitive iron lines Fel A 6173 A and Fel A 5497 A which have an effective Lande 
factor of (?cff = 2.5 and geS = 2.25 respectively. Once the network is successfully trained 
and has converged in terms of minimizing the error between the calculated output vectors 
and the target vectors of the training database, the network weights are frozen and the 
MLP, which now represents the desired approximation of the underlying problem, can 
be applied to new and unknown data coming from the same parameter domain as the 
training data. To test the accuracy of the trained MLPs we created a large number of 
input vectors with randomly chosen combinations for the atmospheric parameters. The 
statistical evaluation shows that the MLPs are able to calculate the corresponding Stokes 
profiles with a high degree of accuracy. The comparison of the MLP calculation with the 
results from the conventional numeric al integration of th e polarized radiative transfer, 
performed with the DELO method of Rees et al. ( 19891 ). demonstrates the impressive 
results of the MLP synthesis. For the Stokes I profile calculation we obtain a rms error 
as low as 0.11 %, for Stokes V 0.17 %; and for Stokes Q and U, slightly above 1 % rela- 
tive to the DELO solution. The MLP has in fact learned to disentangle the different and 
sometimes competitive effects of the various atmospheric input parameters to calculate 
accurate local Stokes profiles for both iron lines. But it is not only the accuracy of the 
method which makes this synthesis method attractive for ZDI inversion, it is first and 
foremost the speed of the calculation. A benchmark test with several workstations con- 
firmed that the MLP synthesis is more than a factor 1 000 f aster then the conventional 
numerical approach, for more details see Carroll et al. ( 2008[) . 



6. An artificial neural network approach to ZDI 

In this section we want to introduce our new statistical inverse approach to ZDI which 
is based on artificial neural networks. Instead of using the ANNs in a forward model- 
ing approach as in the preceding section, we now try to use ANNs to provide a direct 
approximation of the inverse mapping between the observed Stokes spectra and the cor- 
responding magnetic surface distribution. Again we rely on the non-linear approximation 
capabilities of the multilayer-perceptron as given in Eq. 15.31 to find a continuous approx- 
imation of the underlying inverse probler n. ANNs have been already successfully used 
in solar Stokes profile inversion problems (ICarroll fc Staudell200ll: ICarroll fc Kopll2008h 
which have shown their great potentials in approximating the inverse mappings between 
observed Stokes profiles and the underlying atmospheric parameters. One of the key is- 
sues in the following ZDI inversion is the reduction of the input (i.e. Stokes spectra) as 
well as the output dimension (i.e. surface images) by a PCA decomposition. The learning 
task of the neural network is then to approximate the inverse mapping in the reduced 
eigenspace between the eigenprofiles and the corresponding eigenimages of the surface. 
The reduction to the subspace of eigenspectra and images has two main reasons, first, we 
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Figure 3. The reconstructed radial magnetic field of II Peg for the year 2007. In the top the 
reconstruction based on the conventional inversion is shown, whereas, the ANN inversion based 
on the — Q, dynamo model is shown in the bottom. 



reduce the topology of the MLP structure which allows a more stable and faster conver- 
gence in the training process and second, the restriction to a subspace of the problem, has 
a regularizing effect on the solution due to the reduced redundancy in the input spectra 
and the noise reducing effects by describing the solution as a superposition of eigenim- 
ages. Because the transformation and training of the network is based upon a training 
sample, this will have a decisive impact on the resulting eigenspectra and eigenimages. A 
finite training sample will only provide a good statistical modeling in a limited region of 
the parameter domain and an approximation of the inverse function will therefore only 
provide meaningful results in t hat restricted doma in. To apply the ANN inversion on 
real observed Stokes V spectra (jCarroll et al.ll2007l ) we trained our MLP on synthesized 



Stoke s profiles which are based on a theoretical a — dynamo IjElstner &: Korhonen 



20051) . The ~ Vl dynamo originally developed to study and explain the appearance of 



flip-flop events on active stars provides not only a good model repre sentation since ou r 
target star II Pegasi is known to be a candidate for an flip-flop cycle (|Berdvuginall2007l ) . 
it also allows us to incorporate (implicitly) the physics provided by this model into our 
inversion approach. From this simulation we have synthesized approximately 1000 Stokes 
V profiles at different phases and evolution stages of the dynam o. Assuming typical stel- 
lar and atmospheric parameters for the subgiant K2 star II Peg dBerdvugina et al."l998h 
we have synthesized approximately 1000 disk-integrated Stokes V profiles for different 
phases and different evolution stages of the dynamo simulation. The training process fol- 
lows again a large-scale optimization of the network parameters of the MLP to find the 
best possible approximation for the inverse mapping between the decomposed Stokes V 
profiles and the corresponding decomposed surface images After the training process the 
observed II Peg Stokes V spectra applied to the trained MLP to estimate the magnetic 
surface structure. In Figure[3]we see a comparison of the surface magnetic field structure 
of II Peg retrieved with the conventional ZDI inversion made with our ZDI code iMap 
and the new ANN inversion. The surface structure inferred from this first ANN approach 
is in remarkable good agreement with the conventional inversion. The good convergence 
of the training process and the good results of the inversion indicate that the MLP has in 
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fact found a well-behaved inverse mapping to that problem and moreover it shows that 
the dynamo model used for this study provides a reasonable assumption for the surface 
structure observed on II Peg. A mor e detailed descriptio n of this novel ANN approach 



will be given in a forthcoming paper (jCarroll et al. II2009I) 



7. Summary 

Zeeman-Doppler imaging is an inherent non-linear problem even with an appropriate 
regularization, it is not assured that we arrive at a global minimum, we are still fac- 
ing the problem of local minima. This is particular problematic in combination with an 
ill-defined or inadequate forward model. As was shown in Sect. IH even if the observed 
spectra are well reproduced, the solution can be far from the true solution. The fit to 
the observed Stokes V profiles does often not provide enough information to distinguish 
between competing solutions. The additional availability of Stokes Q and U profiles with 
the next gene ration of spectropolarimeter (PEPSI) at the 8.4 m Large Binocular Tele- 
scope (LBT) ( Strassmeier et al. 2007 . 20081) will immensely help to further mitigate this 
problem. But for all the current observations where only Stokes V spectra are available 
we have to rely on the best possible forward modeling. In an effort to provide a more 
systematic radiative transfer modeling we have introduced a new PCA based multi-line 
reconstruction technique. This method allows us to recover Stokes profiles of individual 
spectra lines and thus facilitates their modeling by means of radiative transfer calcu- 
lations. Moreover, we have introduced a fast Stokes profile synthesis which is not only 
accurate but accelerates the process of calculating the Stokes profiles by more than three 
orders of magnitude. In a preliminary study we have investigated the capabilities of a new 
artificial neural network approach to ZDI. ANNs have a number of favorable features, 
they provide a direct inversion in terms of approximating the inverse mapping between 
the observed Stokes profiles and the magnetic surface distribution. And moreover, ANNs 
as a statistical learning tool, allows one to incorporate theory based knowledge in the 
training process which is then utilized in the subsequent inversion. Although further in- 
vestigations are needed, the ANN approach already provides a new and promising way 
of a direct inversion or for a hybrid approach with a conventional ZDI inversion. 
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